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APPROXIMATION  TECHNIQUES  FOR  PARAMETER  ESTIMATION  IN 
HEREDITARY  CONTROL  SYSTEMS 


H.  T.  Banks  and  I.  G.  Rosen 


Abstract 

Two  types  of  approximation  techniques  for  parameter  estimation 
of  delay  systems  are  described  and  compared.  One  involves  state 
discretization  only  while  the  other  entails  simultaneous  state  and 
time  discretization. 


We  consider  two  approximation  techniques  for  parameter 
identification  problems  for  delay  systems  of  the  form 

(1)  *(t)  «  Lxt  +  Bu(t)  0  <  t  <  T 

x(0)  ■  n  xQ  -  <P 

where  B  is  an  n  *  m  matrix,  u  is  an  Rm  valued  function 
that  is  piecewise  continuous  on  [0,TJ  (i.e.,  u  €  PC(0,T)), 

n  €  Rn  and  4  is  an  Rn  valued  function  that  is  square  integrable 
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on  (-r,0)  (i.e.,  41  £  L^C-r.O)).  For  x:[-r,T]  R11  measurable 

and  t  €  [0,T],  we  denote  by  xt  the  measurable  function  given 
by  xt(0)  *  x(t+0)  ,  -r  <  0  <  0.  The  linear  operator 
L:L2(-r,0)  R11  is  assumed  to  be  of  the  form: 

v  r  0 

(2)  ■  l  A.<K-t.)  +  A(s)<j>(s)ds 

j  =0  J  J  J-r 

where  0  =  Tq  <  x^  <  X2«..<  xv=  r,  Aj  ,  j  =  0,1,2. ..v,  are  n  x  n 
matrices  and  A  is  an  n  x  n  matrix  valued  function  which  is 
square  integrable  on  (-r,0).  Strictly  speaking  the  expression 
for  L  given  by  (2)  is  not  well  defined  for  all  <j>  €  L^OrjO)  in 
that  point  evaluations  of  $  are  required.  However,  with  the 
usual  interpretation  (see  [7])  the  system  (1)  has  well  defined 
solutions  for  all  (n,<0  €  Rn  *  L^C-TjO). 

The  basic  parameter  identification  problem  (PID)  is  one  of 
fitting  to  data  a  model  such  as  (1)  (where  u  is  a  fixed  input, 

-  Aj(q)  i  =  0 , 1 , 2 . . .  v,  A(  • )  ■  A(-,q)  and  B  =  B(q)  are  con- 

K 

tinuous  functions  of  some  parameter  q  €  R  )  by  choosing  parameter 
values  q  from  some  compact  set  &  c  R  and  initial  conditions 
n,4>  from  some  compact  Sf  contained  in  Rn  *  L2(-r,0).  To  be  more 
specific,  assume  that  one  is  given  5  €  Cn(0,T)  (Cn(0,T)  denoting 

the  collection  of  Rn  valued  continuous  functions  defined  on  (0,T)) 
that  represents  measurements  on  [0,T]  of  the  "observables'* 
c(t,q)  »  cx(t;q)  +  du(t)  for  (1)  where  t  x(t;q)  is  the  solution 
to  (1)  corresponding  to  a  value  q  £  (If,  as  usual,  only  dis- 
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Crete  measurements  are  made,  the  curve  5  may  be  constructed 
via  some  interpolation  scheme.)  The  problem  then  is  to  find  a 
value  y  »  (q,  (h,<F))  €  3  *  5/ such  that 

J(Y)  *  |c(0,q)  -  C(0)|J  +  I c (T,q)  -  5(T)|J 

W1  2 

T 

+  f  tc(s,q)  -  5(s)|^  ds 
JO  w3 

attains  a  minimum  on  2  at  y  =  y.  Here  w^w^vij  represent 

positive  definite  weighting  matrices. 

These  problems  are  infinite  dimensional  state  system  problems 
and  our  approach  involves  first  rewriting  (1)  as  an  equivalent 
ordinary  differential  equation  (ODE)  in  an  appropriately  chosen 
Hilbert  space  Z.  As  the  state  space  we  choose  Z  =  Rn  *  L^C-r.O) 
since  one  can  argue  equivalence  of  (1)  in  some  sense  (mild  solutions) 
to  the  abstract  ODE  in  Z  given  by 

z(t)  »ja^z(t)  +  (Bu(t)  ,0) 

with  initial  conditions  z(0)  *  Zq  *  (n,4>).  More  precisely,  taking 
x  as  the  solution  to  (1)  on  (0,»)  for  a  given  (n,<l>)  and  u  =  0 
we  define  the  homogeneous  solution  semigroup  (S(t):t  0)  by 

S(t)(n,4>)  -  (x(t;n,<J>),  xt(n,*)).  Then  (S(t):t  >  0}  is  a 
&q  semigroup  of  bounded  linear  operators  defined  on  Z  with 
infinitesimal  generator  defined  on  ■  ( (♦  (0)  ,♦) :  ♦  €  wif2  (*r»®)) 

by  .£*W0),$)  -  (L<M)  (see  [3],  171) . 


In  the  light  of  the  discussion  above,  we  are  able  to  obtain 
solutions  to  the  PID  governed  by  the  delay  system  (1)  via  solutions 
to  the  PID  formulated  in  the  Hilbert  space  Z  given  by 

(&)  Given  input  u  6  PCn(0,T)  and  observation  5  €  Cn(0,T), 
minimize 

J(Y)  -  J(q,z0)  *  |cz (0 ;y)  +  du(0)  -  5(0) |*  + 

T 

|cz(T;Y)  +  du(T)  -  e(T)|  *  ♦  f  |cz(s;Y)  ♦  du(s)  -  £(s)| l  ds 

w2  J  0  3 

over  r  *  @  *  SS  subject  to 

z(t)  -J^(q)z(t)  +  (B(q)u(t) ,0) 
z(0)  *  zQ. 

Approximate  solutions  to  the  PID  (^)  are  obtained  via  the 
following  two  techniques,  one  involving  discretization  in  the  state 
only,  the  other  involving  simultaneous  discretization  in  state  and 
time . 


Technique  1. 

Choose  a  sequence  of  finite  dimensional  approximating  sub¬ 
spaces  ZN  of  Z  and  let  PN  represent  the  orthogonal  projection 
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of  Z  onto  along  (Z^)-1-.  Define  the  operators  (q)  :  Z^  Z1^ 

in  such  a  way  that  they  appropriately  approximate  the  operator 
ja^tq)  on  Z  for  each  q  £  (see  [3],  [7])  and  consider  the 
sequence  of  approximating  PID  given  by 


Given  input  u  €  PCn(0,T)  and  observation  £  €  Cn(0,T) 
minimize : 

JnCyn)  -  JN(q,Zo)  =  IcznC0;yn)  +  du(0)  -  eco)|J  + 

T 

|czN(T;yN)  ♦  du(T)  -  5(T)|2  +  f  |czN(s;YN)  +  du(s)  -  £(s)|*  ds 

w2  Wj 

over  rN  =  x  P Ny  subject  to 

zN(t)  »j*TN(q)zN(t)  +  PN(B(q)u(t),0) 


zN(0)  -  zg. 

For  each  N,  represents  a  PID  with  finite  dimensional  state 

constraint.  Standard  gradient  projection  and  conjugate  gradient 
minimization  techniques  for  optimization  problems  governed  by  ODE 
state  equations  (see  [10].)  may  be  employed  to  obtain  solutions. 


Technique  2. 

Choose  a  sequence  of  finite  dimensional  approximating  spaces 
ZN  (as  opposed  to  subspaces  as  was  the  case  in  Technique  1)  with 
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projection-like  mappings  ir^:Z  *  2^.  Define  operators 
ja^jgCqjrZ^  ■*  ZN  which  in  some  sense  approximate  _o/(q)  for  each 
q  €  (see  [9]).  Let  C(z)  and  D(z)  be  rational  function 
approximations  to  the  exponential  (RFAE)  ez,  and  consider  the 
following  sequence  of  approximating  PID: 


7  .  pN  pN 

0 3^)m  Given  input  (ur,)  eV  R  and  observation 
N  N  j=0  0 

.  pN  PN  /  . 

UN}j_0  K  =  u(^*  =  j  =  0,1,2,. ..pN, 


where  p  is  that  positive  integer  for  which 
pr  5  T  <  (p+l)r^  minimize: 


jn(yn)  *  jn^*z0N}  *  lcNZN^YN^  +  duN  '  ^Wj  + 
I^NzNN^YN^  +  duNN"  ^NN'w2  ♦JjIq  I^NZN^V  +  duN  N^w3 


over  rN  *  subject  to 


*  C(j^O^(q))zN  +  j^D  (fj  (<l)  )  Bjj(q)  u^ ,  k  =  0,1,2.  ..pN-1 


z 


0 

N 


Z0N 


where  BN(q)n  =  ^N(B(q)n,0)  for  n  €  Rn. 

2 

For  each  N,  is  a  PID  governed  by  a  difference  equation 
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and  can  be  solved  via  standard  numerical  methods  which  are 
readily  available  (see  [10]).  Under  appropriate  conditions  on  ZN, 
PN,  -Q?,N(q),in  Technique  1  (see  [5])  and  on  ZN,  ttn,  j*?N(q),  C(z), 
D(z)  in  the  case  of  Technique  2  (see  [9]),  the  following  results 
may  be  established. 


Theorem  1.  Suppose  >  (respectively  {y^}) 

solutions  to  the  approximate  problems 

(J3f^)i).  Then  there  exists  a  y  =  (q,zft)  £  T 
N  N  Nu  N 

{yN  >  =  UqN  ,z0N  )}  and  (y  k)  =  { (q  k,z0k)} 
lc  k  k 


is  a  sequence  of 
(respectively 
and  subsequences 
such  that 


and 


_  _  v  _  *'• 

(a)  %  q  in  R  (q  ■*  q) 

_  \ 

(b)  \  z0Nk  *  z0  in  2  (z0  *  z0  in  Z) 


where  :  ZM  Z  denotes  the  Moore-Penrose  generalized  inverse 

k  k 

of  irM  (see  [8]).  Moreover,  in  both  cases  y  will  be  a  solution 
Nk 

to  problem  (^).  If  the  solution  to  problem  ( ^)  is  unique, 
then  the  sequences  {y^},  {y^}  themselves  converge  to  y  in  the 
above  sense. 


In  both  Techniques  1  and  2,  schemes  satisfying  the  conditions 

required  to  establish  the  veracity  of  the  above  theorem  may  be 

N  N  N 

realized  via  the  construction  of  Z  ,p  (q)  or  ZN,  ttn,  J*N(q) 
based  upon  finite  difference  or  spline  approximations  (see  [ 3] , [7] , [ 9] ) . 


The  RFAE  C(z)  and  D(z)  can  be  chosen  from  among  a  subclass 
of  the  Pade  approximations  to  ez  (see  [9]).  Proofs  of  the 
convergence  stated  in  the  theorem  rely  heavily  upon  results  from 
functional  analysis  and  linear  semigroup  theory. 

We  have  tested  the  above  techniques  on  a  number  of  examples 
(see  [4]  for  extensive  tests  of  Techique  1)  using  the  "averaging"  and 
"spline"  state  approximations  of  [3]  and  [7]  respectively.  We  present 
here  a  typical  example  to  illustrate  and  compare  results  for  the  two 
techniques . 

Example 

We  consider  the  scalar  equation 


*(t)  =  aQx(t)  +  a1x(t-l)  +  u(t) 

with  step  input  u  =  x  [  ^  ,  initial  data  x(0)  =1,  -1  <  6  5  0,  and 

observations  c(t)  =  x(t).  "Data"  5  was  generated  on  [0,2]  by 
integrating  the  equation  exactly  with  true  parameter  values  a^  =  .05 
and  a*  =  -4.0.  (The  techniques  were  also  tested  with  "data"  generated 
by  adding  random  noise  to  the  true  solution.  The  resulting  parameter 
estimates  obtained  are  essentially  unchanged  from  those  obtained  using 
"data"  without  noise.)  For  several  values  of  approximation  level  N, 
iterative  techniques  with  startup  values  a^’0  ■  .03,  a^’°  =  -3.0  were 
used  to  solve  the  approximating  problems  Co^)N  and  (_q^)n  corre¬ 
sponding  to  this  "data"  5  on  [0,2],  The  results  obtained  are 
presented  in  tabular  form  below.  In  both  techniques  the  AVE  refers  to 
the  "averaging"  state  approximation  of  [3]  while  SPL  1  refers  to  the 
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piecewise  linear  spline  state  approximations  of  [7].  In  Technique  2, 
we  used  the  Pade  RFAE 

C(z)  =  P22(z)  =  (1  +  \  z  +  z2/12)/(l  -  \  z  +  z2/12) 

and 

D(z)  =  P2Q(j  z)  =  (1  -  i  z  +  z2/8) " 1 

which  correspond  to  time  discretization  schemes  of  approximation 
index  q  =  4  (see  [9]  for  details  and  further  discussions). 

TECHNIQUE  1 


AVE  SPL  1 


N 

aN 

a0 

al 

2  O 
Irt 

aN 

al 

2 

1.0869 

-4.6236 

.0995 

-4.1639 

4 

.6525 

-4.3160 

.0417 

-4.0523 

8 

.3825 

-4.1660 

.0439 

-4.0222 

16 

.2245 

-4.0898 

.0449 

-4.0151 

32 

.1384 

-4.0505 

.0454 

-4.0133 

TRUE 

VALUES 

.0500 

-4.0000 

.0500 

-4.0000 

TECHNIQUE 

_2 

AVE 

SPL  1 

N 

a0N  alN 

a0N 

alN 

2  1 

.1742  -4.9316 

.1564 

-4.3972 

4 

.6325  -4.4671 

.0045 

-4.1682 

8 

.3368  -4.2838 

-.0121 

-4.1307 

16 

.2104  -4.1252 

.0292 

-4.0488 

32 

.1400  -4.0460 

.0474 

-4.0089 

TRUE 

VALUES 


0500 


-4.0000 


0500 


-4.0000 
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The  results  obtained  from  our  comparisons  of  Techniques  1  and  2  j 

! 

on  a  number  of  examples  reveal  that  when  the  approximations  reprented  by 

C(ij  and  D(^  in  are  of  an  order  comparable  to  I 

that  of  the  scheme  used  in  solving  the  ordinary  differential  equat  ion 

for  zN  in  ( j the  parameter  estimates  obtained  are  comparable. 

The  implementation  of  Technique  2,  which  involves  a  simple  difference 

equation  approximation  for  the  delay  equation,  is,  in  many  cases,  quite 

simple.  For  both  techniques  our  studies  indicate  that  the  first  order 

spline  methods  of  [7]  lead  to  schemes  that  are  superior  in  many 

situations  to  those  based  on  the  averaging  methods  of  [3]  (see  [4]  and 

[9]).  Of  course,  when  one  employs  the  cubic  spline  approximations  of 

[7]  (e.g.,  see  [9])  even  more  impressive  results  are  obtained. 

The  techniques  discussed  in  this  paper  have  also  been  used  with 

success  for  parameter  identification  in  problems  where  the  delays 

T1,T2,..*,TV  =  r  themselves  are  among  the  parameters  to  be  estimated 

(see  [1],  [4],  [6]  for  Technique  1;  numerical  results  for  Technique  2 

in  this  case  along  with  proofs  will  appear  in  a  forthcoming  manuscript) 

even  in  situations  where  does  not  exist  (for  discussions  of  this 

i 

and  its  significance,  see  [1]).  Finally,  the  methods  are  also 
applicable  to  certain  classes  of  nonlinear  parameter  estimation  and 
control  problems  (see  [2],  [6],  [9]). 
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